#AI Agents

41 articles

Tech Apr 7, 2026 updated 8 min

Claude Code adaptive thinking regression: 17,871 thinking blocks, 234,760 tool calls, and what changed

Breakdown of the Claude Code quality-regression issue: an AMD/IREE developer analyzed 17,871 thinking blocks and 234,760 tool calls, linking adaptive-thinking shrinkage and redaction changes to worse coding behavior.

Claude Code Anthropic AI Agents Coding Agents

Tech Apr 7, 2026 17 min

Vulnerabilities Found by Scanning 50 Open-Source MCP Servers

A security scan of 50 open-source MCP servers found 61% lacked input validation. This article covers real vulnerabilities in high-profile servers like Playwright MCP and Puppeteer MCP, and examines when to skip MCP entirely and use CLI tools directly.

Security MCP AI Agents Supply Chain

Tech Apr 4, 2026 14 min

All Claude tiers jailbroken: AFL attack and the structural failure of constitutional safety

A security researcher bypassed Claude Opus 4.6's policy evaluation with just four short prompts, generating attack code against live infrastructure. Plus 915 files exfiltrated from the sandbox.

Security Claude Anthropic LLM Safety Jailbreak AI Agents

Tech Apr 3, 2026 9 min

Cursor 3 turns the IDE into an agent control tower

Cursor redesigned its UI from scratch, adding parallel agent execution, seamless cloud/local handoff, and Design Mode. Here is how that changes the IDE model and how it compares with other AI coding tools.

Cursor AI AI Agents Development Environment Cloud

Tech Apr 3, 2026 7 min

OpenClaw's SSH sandbox can be escaped through a symlink, CVSS 8.8

A symlink validation bug in OpenClaw's SSH sandbox sync path lets an AI agent read or write arbitrary local files outside the sandbox. GHSA-fv94-qvg8-xqpw, CVSS 8.8.

Security OpenClaw AI Agents CVE Vulnerability

Tech Apr 2, 2026 14 min

Run multiple agents in parallel with GitHub Copilot CLI's `/fleet` command

How Copilot CLI's `/fleet` command works and how to use it: it automatically splits tasks, dispatches subagents in parallel, and schedules them while respecting dependencies.

GitHub AI AI Agents Cloud Development

Tech Mar 27, 2026 6 min

HyperAgents shows that improving the way you improve can transfer beyond coding

Meta AI's HyperAgents performs metacognitive self-correction that optimizes improvement strategies themselves. Self-improvement appears in four non-coding domains, and strategies learned in one domain transfer to another, along with spontaneously acquired persistent memory.

MachineLearning AI AI Agents Open Source Research

Tech Mar 18, 2026 4 min

Holotron-12B Makes PC-Operation AI 1.7× Faster, and Unsloth Studio Lets You Tune Models Without Code

H Company's Holotron-12B uses a memory-efficient new design to lift PC-operation AI throughput to 8,900 tokens per second. Unsloth has released the beta of 'Studio,' a browser tool for no-code model fine-tuning.

AI LLM AI Agents Unsloth Local LLM

Tech Mar 10, 2026 5 min

OpenAI's Promptfoo acquisition and Microsoft's shift to a multimodel stack

OpenAI acquired AI security evaluation platform Promptfoo, and Microsoft announced that Anthropic's Claude Cowork would be integrated into Microsoft 365 Copilot. The structure of the enterprise AI market is starting to change.

OpenAI Microsoft Anthropic Claude Security Copilot AI AI Agents

Tech Mar 10, 2026 7 min

Karpathy's Autoresearch lets AI run 100 ML experiments while you sleep

Andrej Karpathy released Autoresearch, a system where an AI agent autonomously runs machine-learning experiments on a GPU and tries 100 variants overnight. The article breaks down the mechanism and design so even readers with zero ML background can follow.

AI MachineLearning LLM AI Agents OSS

Tech Feb 25, 2026 9 min

AMOS turns AI agents into a delivery vehicle via malicious OpenClaw SKILL.md on macOS

Trend Micro analyzed a new AMOS distribution method that targets AI agent workflows. A malicious SKILL.md on OpenClaw plants fake CLI install instructions and uses AI as the intermediary to manipulate people.

Security macOS AI Agents Malware Supply Chain OpenClaw

Tech Feb 24, 2026 updated 7 min

Injection Attacks on AI Agent Memory and Automated Smart Contract Exploitation with EVMbench

Techniques and defenses from the MINJA, InjecMEM, and ToxicSkills campaigns that poison AI agents’ memory files, and the fact that GPT-5.3-Codex achieved a 72% exploit success rate on EVMbench released by OpenAI and Paradigm. This article organizes how AI becomes both a target of attacks and a weapon for attackers.

Security AI Agents Prompt Injection MCP Ethereum Smart Contracts OpenAI Supply Chain